User Expectations from Dictation on Mobile Devices

نویسندگان

  • Santosh Basapur
  • Shuang Xu
  • Mark Ahlenius
  • Young Seok Lee
چکیده

Mobile phones, with their increasing processing power and memory, are enabling a diversity of tasks. The traditional text entry method using keypad is falling short in numerous ways. Some solutions to this problem include: QWERTY keypads on phone, external keypads, virtual keypads on table tops (Seimens at CeBIT ‘05) and last but not the least, automatic speech recognition (ASR) technology. Speech recognition allows for dictation which facilitates text input via voice. Despite the progress, ASR systems still do not perform satisfactorily in mobile environments. This is mainly due to the complexity of capturing large vocabulary spoken by diverse speakers in various acoustic conditions. Therefore, dictation has its advantages but also comes with its own set of usability problems. The objective of this research is to uncover the various uses and benefits of using dictation on a mobile phone. This study focused on the users’ needs, expectations, and their concerns regarding the new input medium. Focus groups were conducted to investigate and discuss current data entry methods, potential use and usefulness of dictation feature, users’ reaction to errors from ASR during dictation, and possible error correction methods. Our findings indicate a strong requirement for dictation. All participants perceived dictation to be very useful, as long as it is easily accessible and usable. Potential applications for dictation were found in two distinct areas namely communication and personal use.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mobidic - a mobile dictation and notetaking application

Mobile devices have become ubiquitous and reasonably powerful and well connected. However, their physical size limits possibilities of interaction, especially document creation. Dictation in mobile setting provides one solution, but limited processing power requires that the actual speech recognition is distributed to a server computer. We present MobiDic, a distributed mobile dictation applica...

متن کامل

Interactive ASR Error Correction for Touchscreen Devices

We will demonstrate a novel graphical interface for correcting search errors in the output of a speech recognizer. This interface allows the user to visualize the word lattice by “pulling apart” regions of the hypothesis to reveal a cloud of words simlar to the “tag clouds” popular in many Web applications. This interface is potentially useful for dictation on portable touchscreen devices such ...

متن کامل

Very large vocabulary voice dictation for mobile devices

This paper deals with optimization techniques that can make very large vocabulary voice dictation applications deployable on recent mobile devices. We focus namely on optimization of signal parameterization (frame rate, FFT calculation, fixedpoint representation) and on efficient pruning techniques employed on the state and Gaussian mixture level. We demonstrate the applicability of the propose...

متن کامل

Information Leakage through Mobile Motion Sensors: User Awareness and Concerns

Smart phones and wearable devices have replaced personal computers and desktops as the primary platform for accessing online applications and services. However, these mobile devices bring forth new and additional forms of security and privacy risks, which were non-existent in traditional personal computers. For instance, several recent research efforts have shown that motion sensors such as acc...

متن کامل

Will Input Style Affect Mandarin Short Messages in Mobile Device?: a Wizard of Oz Study

Speech input is a natural text entry method for handheld devices that are used in different contexts. We conducted an experiment to understand effects of input (speaking) style (phrasal vs. sentence input) on Chinese text entry rates and user satisfaction with other two variables: recognition rate (50%, 70% and 90%) and message length (10 vs. 20 characters). Wizard of Oz was applied in the expe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007